Fast Biped Walking with a Sensor-driven Neuronal Controller and Real-time Online Learning

نویسندگان

  • Tao Geng
  • Bernd Porr
  • Florentin Wörgötter
چکیده

In this paper, we present our design and experiments on a planar biped robot under the control of a pure sensor-driven controller. This design has some special mechanical features, for example small curved feet allowing rolling action and a properly positioned center of mass, that facilitate fast walking through exploitation of the robot’s natural dynamics. Our sensor-driven controller is built with biologically inspired sensorand motor-neuron models, and does not employ any kind of position or trajectory tracking control algorithm. Instead, it allows our biped robot to exploit its own natural dynamics during critical stages of its walking gait cycle. Due to the interaction between the sensor-driven neuronal controller and the properly designed mechanics of the robot, the biped robot can realize stable dynamic walking gaits in a large domain of the neuronal parameters. In addition, this structure allows the use of a policy gradient reinforcement learning algorithm to tune the parameters of the sensor-driven controller in real-time, during walking. This way RunBot can reach a relative speed of 3.5 leg lengths per second after only a few minutes of online learning, which is faster than that of any other biped robot, and is also comparable to the fastest relative speed of human walking. KEY WORDS—dynamic biped, reflex, neuronal controller, online learning, fast walking The International Journal of Robotics Research Vol. 25, No. 3, March 2006, pp. 243-259 DOI: 10.1177/0278364906063822 ©2006 Sage Publications

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fast biped walking with a reflexive controller and real-time policy searching

In this paper, we present our design and experiments of a planar biped robot (“RunBot”) under pure reflexive neuronal control. The goal of this study is to combine neuronal mechanisms with biomechanics to obtain very fast speed and the on-line learning of circuit parameters. Our controller is built with biologically inspired sensorand motor-neuron models, including local reflexes and not employ...

متن کامل

Learning CPG-based Biped Locomotion with a Policy Gradient Method: Application to a Humanoid Robot

In this paper we describe a learning framework for a central pattern generator (CPG)-based biped locomotion controller using a policy gradient method. Our goals in this study are to achieve CPG-based biped walking with a 3D hardware humanoid and to develop an efficient learning algorithm with CPG by reducing the dimensionality of the state space used for learning. We demonstrate that an appropr...

متن کامل

Learning CPG Sensory Feedback with Policy Gradient for Biped Locomotion for a Full-Body Humanoid

This paper describes a learning framework for a central pattern generator based biped locomotion controller using a policy gradient method. Our goals in this study are to achieve biped walking with a 3D hardware humanoid, and to develop an efficient learning algorithm with CPG by reducing the dimensionality of the state space used for learning. We demonstrate that an appropriate feedback contro...

متن کامل

Experimental realization of dynamic walking of the biped humanoid robot KHR-2 using zero moment point feedback and inertial measurement

This paper describes a novel control algorithm for dynamic walking of biped humanoid robots. For the test platform, we developed KHR-2 (KAIST Humanoid Robot-2) according to our design philosophy. KHR-2 has many sensory devices analogous to human sensory organs which are particularly useful for biped walking control. First, for the biped walking motion, the motion control architecture is built a...

متن کامل

Robust Trajectory Free Model Predictive Control of Biped Robots with Adaptive Gait Length

This paper employs nonlinear disturbance observer (NDO) for robust trajectory-free Nonlinear Model Predictive Control (NMPC) of biped robots. The NDO is used to reject the additive disturbances caused by parameter uncertainties, unmodeled dynamics, joints friction, and external slow-varying forces acting on the biped robots. In contrary to the slow-varying disturbances, handling sudden pushing ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • I. J. Robotics Res.

دوره 25  شماره 

صفحات  -

تاریخ انتشار 2006